Origin of entropy convergence in hydrophobic hydration and protein folding 
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An information theory model is used to construct a molecular explanation why hydrophobic sol- 
vation entropies measured in calorimetry of protein unfolding converge at a common temperature. 
The entropy convergence follows from the weak temperature dependence of occupancy fluctuations 
for molecular-scale volumes in water. The macroscopic expression of the contrasting entropic be- 
havior between water and common organic solvents is the relative temperature insensitivity of the 
water isothermal compressibility. The information theory model provides a quantitative descrip- 
tion of small molecule hydration and predicts a negative entropy at convergence. Interpretations of 
entropic contributions to protein folding should account for this result. 
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High sensitivity calorimetry on the unfolding of globular proteins has suggested that hydrophobic contributions 
to the entropies of unfolding converge to zero near 385 K 0-^]. Additional thermodynamic information on protein 
folding processes is then obtained by extrapolating the calorimetric measurements to the convergence temperature 
where hydrophobic contributions vanish. The convergence behavior of entropies of solution of hydrocarbons in water 
is known also ^ -[)). Yet a microscopic-level mechanism for this phenomenon has not been offered |fro| . Here we 
identify a mechanism by analysis of an information theory model of hydrophobic hydration p| , |l2| . We show how the 
theory predicts entropy convergence on the basis of the density and density fluctuations of liquid water. Consistent 
with experimental results on hydrophobic hydration, the model predicts that the entropy at convergence should be 
negative. 

Aqueous protein solutions are complex systems involving molecular interactions of several kinds, including those 
associated with ionic and polar groups, in addition to hydrophobic contributions [gj. For clarity we focus on strictly 
hydrophobic species and idealize those solutes as hard core objects that perfectly repel water molecule centers identified 
as the position of the oxygen atoms. Statistical mechanics relates the excess chemical potential of hard core solutes 
to the probability, po, of finding an empty volume, v, or a cavity of a given size and shape in water, 

Ap ex = -kT\n Po . (1) 

We calculate po by considering the probabilities, p n , of observing exactly n solvent centers in the cavity. The p n 



are predicted by maximizing an information entropy 11 T^], subject to the constraints of available information. The 
experimentally accessible first and second moments of the number of solvent centers in the cavity region constitute the 
generally available information. This procedure yields here the distribution p n = exp(Ao + Ain + A2n 2 ) where Ao, Ai, 
and A2 are Lagrange multipliers. The required moments are obtained from the number density p and oxygen-oxygen 
radial distribution function g(r) by 

(n) = pv, (2) 
(n(n - 1)) = p 2 [ dr [ dr'g(\r - r'|). (3) 



That this is an accurate model for the circumstances considered here has been explicitly verified pl|Jl2] |. 

Figure |l| shows the calculated Apf x for spherical solutes as a function of temperature along the saturation curve of 
liquid water. For comparison, chemical potentials calculated directly from simulation using test particle insertions |T^ 1 
are also shown. The quantitative agreement between the two methods is excellent over the entire temperature range. 
The chemical potential increases with temperature past 400 K but eventually decreases. The maximum in chemical 
otential occurs at about the same temperature in each case. These curves have the same shape as experimental ones 
1| for inert gases dissolved in water but shifted upward due to the use of a hard sphere model. 
Entropies calculated as the temperature derivative of Ap ex along the saturation curve are shown in Figure |[ As 
expected, these entropies are large and negative at room temperature, and increase with temperature. The entropies 
of hydration for these solutes converge in a temperature region about 400 K, close to the temperature at which they 
are zero. The observed entropy convergence for transfer of simple nonpolar species from the dilute gas to water 
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Jl6[ is similar. Since water densities and 0-0 radial distribution functions constitute the only input to our model, 
the behavior shown in Figures [l] and |^ must arise from these properties of liquid water. We therefore examine the 
connection between these properties and the solvation free energies. 

A simplification of the two moment model gives an explicit expression connecting the chemical potential to the 
density and density fluctuations of liquid water: 

Afj, ex « Tp 2 {kv 2 /2(T 2 }+T{k\n{2iT(j 2 )/2}. (4) 



This is obtained from the Gaussian estimate p n s» exp(— 5n 2 /2a 2 )/ V 2-ko 2 where 8n = n — (n) , and a 2 = (Sn 2 ). The 
Gaussian formula is consistent with the Pratt-Chandler JlTj theory. See ]ll] , |l2| , |l8| . Since the second term of eqn. ^ 
is smaller than the first and is only logarithmically sensitive to the size of the solute, this relationship says physically 
that the solvation free energy may be lowered by decreasing the density or the temperature of the solvent [the Tp 2 
factor] ; or by enhancing the ability of the solvent to open cavities of a size necessary to accommodate the solute [the 
a 2 factor in the first term] . Along the saturation curve in Figure [l], the combination Tp 2 (T) exhibits a nonmonotonic 
temperature dependence. Surprisingly, a 2 (T,v) has a negligible dependence on the temperature over that range, so 
that 

Ap™ *Tp 2 (T)x(v)+Ty(v). (5) 

The quantities x(v) and y(v), defined by the correspondence between eqns ^ and |5|, depend only on the excluded 
volume of the solute, not on the temperature. Because the T and v dependence is important in both terms of 
eqn. ^ we require a generalization of the arguments | ^J^] based upon the hypothesis of convergence at zero entropy. 
Discussion of the more general argument is illustrated by the schematic Figure ||. To begin, note [Figure ||(a)] that if 
Ty(v) were neglected in eqn. ^| a precise convergence at zero entropy would be observed; this follows from previous 
arguments |^|,^). But Ty{v) is present, in fact, and adds — fcln(27rcr 2 )/2 to the entropies. Since this contribution varies 
only logarithmically with v, it provides [Figure §(b)] a downward shift, approximately the same for each entropy 
curve. An accurate convergence is obtained at the entropy — k\n(2ira 2 )/2. Finally, if we include the neglected 
temperature dependences by obtaining the temperature derivatives of eqn. |l|, the convergence is further shifted to 
lower temperatures and entropies and is further blurred as shown in Figure 0(c). 

In contrast, for nonpolar liquid solvents we anticipate a stronger temperature dependence for a 2 (T,v) in addition 
to the nonmonotonic temperature dependence of Tp 2 (T). The temperature dependences of both Tp 2 (T) and a 2 (T, v) 
would be needed to describe the variation of the chemical potential along the coexistence curve. Eqn. ^| would not 
apply and entropy convergence would not be expected. The macroscopic expression of this contrasting behavior 
is the relative temperature insensitivity of the water isothermal compressibility compared to hydrocarbon liquids; in 
comparison to benzene, normal alkanes, and carbon tetrachloride, the isothermal compressibility of liquid water varies 
only weakly along the saturation curve up to 450 K jl9) . 

In conclusion, the temperature maximum in chemical potentials for hydrophobic hydration and the entropy con- 
vergence both follow directly from the weak temperature dependence of occupancy fluctuations a 2 (T,v) = {Sn 2 } for 
solute excluded volumes in water. Further, the value of the entropy at convergence is negative. Interpretations of 
entropic contributions to protein folding should account for this result. 

The excellent agreement shown in Figure [l] between the test particle insertion results and those obtained from the 
information theory model demonstrates that a Gaussian distribution adequately represents molecular-scale density 
fluctuations in liquid water. This ability to relate physical features of water structure to the observed thermodynamics 
of hydrophobic hydration holds promise for extending the information theory modeling to proteins and protein-ligand 
complexes in aqueous solutions. 
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FIG. 1. Excess chemical potentials A/i ex of model hard sphere solutes of sizes roughly comparable to Ne, Ar, methane 
(Me), and Xe as a function of temperature. The hard sphere diameters used were 2.8A, 3.1 A, 3.3A, and 3.45A, respectively. 
The hard sphere diameter is then kept constant at various temperatures. The effects of more general solute-solvent interactions 
can be included subsequently ^|. The lines indicate the information theory model results and the symbols are the values 
calculated using the test particle insertion method [Q. Typical error bars in the test particle insertion method are indicated 
for each solute. The required second moments were obtained from oxygen-oxygen radial distribution functions calculated from 
Monte Carlo simulations of 256 simple point charge (SPC) flq] water molecules. 



FIG. 2. — (dA/j, ex /dT) t along the saturation curve of liquid water for model hard sphere solutes of sizes comparable to Ne, 
Ar, methane (Me), and Xe as a function of temperature. Note that the convergence point would be more sharply defined if the 
Ne results were excluded. Analysis of solubility data from different sources jl6| locates the convergence temperature for Ne, Ar, 
methane, and Xe in the neighborhood of 400 K. This convergence region seems to move systematically to lower temperatures 
and entropies as data on smaller solutes are excluded, becoming more consistent with the values assumed in protein unfolding 
experiments. Closely examined, the width of the entropy convergence region is several times 10 K for inert gas solubilities. 
Additional, equation of state contributions to the standard solvation entropy are negligible: \(dfi ox /dT) p — (dfj°* /dT) sat \ < 1 
and < 10 (J/mol)/K for temperatures T < 450 and < 550 K, respectively. 



FIG. 3. — (dA(i ex /dT) sat along the saturation curve of liquid water, as in Figure |2| but schematically, (a) Contribution to 
the entropy from the Tp 2 (T)x(v) term of eqn. |^. This contribution dominates the eqn. ^ (b) Sum of the contributions from 
both the terms in eqn. H. The constancy of a 2 (T,v) with temperature is assumed. For the hard sphere models considered here 
the downward shift is about 7-9 J/mol/K. (c) Entropies calculated from eqn. ^ accounting for the temperature dependence 
of a 2 (T,v). For the hard sphere model solutes the leftward shift is roughly 40-50 K and the curves are shifted downward by 
another 3-7 J/mol/K, approximately. The total downward shift of 10-16 J/mol/K is about a quarter of the entropy at room 
temperature; see Figure H. 
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